biodiversity data

The legacy of impactful biodiversity research: Pensoft at Living Data 2025

Events like these continue to be of great significance for Pensoft as it works to innovate the landscape of academic data management and scientific outreach.

Effective biodiversity conservation at the global level requires consolidated, streamlined and open scientific data to support it. This was the tenet at the heart of Living Data 2025, a conference unprecedented in its scale and ambition to foster a transcontinental dialogue on the past, present and future of research into the biosphere.

The event took place between 21 and 24 October in Bogotá, Colombia, and was made possible via an extensive collaboration between the biodiversity networks GBIF, TDWG, OBIS and GEO BON, with support from the Humboldt Institute.

With an audience spanning the globe and a four-day agenda reflecting the diversity of innovations and challenges to be addressed in this context, the scene was set for an inclusive and productive dialogue on biodiversity data.

For its part, Pensoft seized the opportunity to join this crucial forum. Represented by founder and CEO Prof. Lyubomir Penev, CTO Teodor Georgiev and Science Communication Expert Peter Bozakov, the open-access scholarly publisher and technology provider became an active participant in the programme as:

Three men pose in front of a colorful backdrop featuring tropical plants and the event title "Datos Vivos 2025" in Bogotá, Colombia. — Pensoft’s Chief Technology Officer Teodor Georgiev, Science Communication Expert Peter Bozakov, and founder and Chief Executive Officer Prof. Lyubomir Penev

Еxhibitor on the conference floor

Pensoft’s representatives were front and centre at the event by virtue of a dedicated booth showcasing the company’s work in academic publishing and science communication, as well as FAIR biodiversity data innovation. A wide array of materials was available for researchers to browse through, reflecting a variety of scientific subjects and endeavours. The ensuing conversations reflected a shared commitment to a more ambitious biodiversity research landscape today and tomorrow, as the parties charted potential avenues for cooperation.

A booth at a conference displays biodiversity research materials and promotional items, with two men posing. — Pensoft’s stand at Living Data 2025.

Sponsor of the Best Student Presentation award

Unwavering in its support for young scientists and early-career researchers, Pensoft also left a mark with its sponsorship of the most critically acclaimed student oral talk delivered at Living Data 2025. During the conference’s closing ceremony, Prof. Lyubomir Penev delivered the award to Mélisande Teng for her presentation, titled “A machine learning approach to species distribution modelling using remote sensing and citizen science data“. This distinction entitles her to a free publication in one of the journals in Pensoft’s extensive and exclusively open-access portfolio.

A speaker stands behind a podium at a conference with a presentation backdrop showcasing various partners, including logos and event details. — Prof. Penev presenting the Best Student Presentation award

Co-organiser of a symposium

Last but not least, Pensoft drew on its experience across its multiple expertises to address some of the topical pillars of the event in its own symposium. The publisher and technology provider was joined in this effort by long-standing partners from LifeWatch ERIC (represented by its CEO Christos Arvanitidis) and the Naturalis Biodiversity Centre (represented by Niels Raes).

Together, they delivered two sessions sharing the title “Long Live Biodiversity Data: Knowledge Transfer and Continuity across Research Projects”. In that sense, the aim was to emphasise the importance of science results being repurposed and reused, finding new life beyond the endeavours that gave rise to them. The role of open data, targeted communication and clearly defined pathways to impact in decision-making was singled out as an essential aspect on the road to such long-lived outputs.

Both sessions attracted the attention of attendees, leading to proactive engagement with the topics in focus.

A number of ongoing projects and initiatives – where Pensoft has been involved as an active consortium partner – were in the spotlight, including Biodiversity Meets Data, B-Cubed, OneSTOP, BioAgora, FORSAID, WildPosh, IP4OS and GATE. Special mention was also afforded to SOLO and eLTER, as well as the concluded BiCKL, EuropaBON, HOMED and PoshBee.

Together with a number of other fellow projects, they provided inspiring testaments to the potential of results to grow beyond the vision they first emerged out of. Overall, the symposium brought together 16 abstracts with over 90 contributing authors, more than 20 initiatives and more than 30 affiliated institutions and organisations. The recordings of Session #1 and Session #2 are already available on YouTube.

Later this year, extended abstracts presented throughout the Living Data 2025 conference will be published in the open-access journal Biodiversity Information Standards and Science (BISS): the official scholarly outlet of TDWG launched in 2017 in partnership with long-term collaborator Pensoft. Initiated by a dedicated call from TDWG, this year’s extended abstracts collection will provide further insight into the perspectives, opportunities and issues discussed in the respective showcases.

All in all, the conference was a noteworthy milestone for the international biodiversity community – an exchange of views, results and opportunities at a broad geographical and multidisciplinary scale that is truly oriented towards tangible outcomes for the planet’s future. As ever, formats like these continue to be of great significance for Pensoft as it works to innovate the landscape of academic data management and scientific outreach across and beyond borders.

Relive highlights of the conference on Bluesky and LinkedIn using the hashtag #LivingData2025.

Did you know that three years ago Pensoft hosted the TDWG annual conference? Check out the highlights on our blog!

Scientists call for a global alliance to place biodiversity at the heart of the UN Pact for the Future

A new white paper delivers a clear message: protecting biodiversity is not just an environmental issue. It is essential for food security, public health, climate stability, and the global economy.

A new white paper: “From Knowledge to Solutions: Science, Technology and Innovation in Support of the UN SDGs”, published in the open-science scholarly journal Research Ideas and Outcomes (RIO), brings together leading voices from Europe’s biodiversity and data science communities to deliver a clear message: protecting biodiversity is not just an environmental issue. It is essential for food security, public health, climate stability, and the global economy.

The authors make a call for a decisive shift: from fragmented initiatives to a holistic, global approach to biodiversity research and policy, already demonstrated during a workshop at the 79th United Nations General Assembly and the Science Summit (UNGA79). A key part of this transformation concerns the role of research infrastructures in connecting science, technology, and policy: from vast biodiversity collections and genomic observatories, to ecosystem “digital twins” powered by supercomputers.

Behind the paper are a network of legal entities based in Europe and holding global interests, which includes biodiversity, ecology, and engineering communities, coordinated by the LifeWatch European Research Infrastructure Consortium (ERIC).

With their combined expertise and through European initiatives, such as Research Infrastructures, e-Infrastructures, the European Open Science Cloud (EOSC), the Digital Twin projects and academic publishers, these communities provide a basis for collaboration in strategically contributing to the implementation of the Kunming-Montreal Global Biodiversity Framework (K-M GBF) targets.

Biodiversity needs to be placed at the centre of the upcoming 2026 UN Summit of the Future and become a core pillar of the agenda after the 2030 deadline for the United Nations Sustainable Development Goals (UN SDGs).

The UN Pact for the Future should include biodiversity as a core pillar: “not only of environmental sustainability, but of equity, security, and intergenerational justice”.
urges the team.

To do this, the authors propose the establishment of a global alliance that will strategically integrate biodiversity conservation into the core priorities of the UN Summit of the Future and the post-SDG agenda.

This alliance is meant to join the voices of researchers, policymakers, indigenous knowledge holders, civil society, and industry to ensure that biodiversity underpins peace, prosperity, and justice as a universal enabler.

The white paper also demonstrates how the research infrastructures collectively contribute to the seven Strategic Considerations of the K-M GBF, outlined here in brief and further detailed in the full publication:

Contribution and rights of Indigenous Peoples and local communities: Ensuring fair recognition and sharing of benefits with indigenous peoples and local communities, thus integrating their knowledge into biodiversity science.
Collective efforts towards the targets of the K-M GBF: Coordinating biodiversity monitoring, databases, and digital infrastructures to track progress towards global conservation targets.
Fulfilment of the three principal objectives of the Convention on Biological Diversity (CBD) and its protocols: Studying or supporting the study of all aspects of biodiversity; and providing public and streamlined access to biodiversity information.
Implementation through science, technology, and innovation: Developing and offering technologically advanced and novel solutions for research, data sharing and management to various users; and promoting open science by publishing research findings and increasingly sharing more facets of the research process.
Ecosystem approach: Developing and implementing technologies that enable a cross-domain, multidisciplinary approach to studying biodiversity and ecosystems; and using holistic, cross-disciplinary methods to understand and predict biodiversity and environmental dynamics.
Cooperation synergies: Collaborating with organisations responsible for implementing the CBD, policy agents, international research projects; and participating in international forums and social, scientific and technical initiatives.
Biodiversity and health linkages: Demonstrating how healthy ecosystems support human health, food security, and resilience to pandemics by supporting interdisciplinary research through bringing together knowledge and data and uncovering links and interactions between humans and the environment.

“With the UN’s ‘Pact for the Future’ currently being shaped, we see a unique opportunity to anchor biodiversity as a unifying thread across global goals that will transform how societies respond to the intertwined crises of climate change, nature loss, and pollution,” say the authors.

The white paper is the latest contribution to the LifeWatch ERIC Strategic Working Plan Outcomes open-science collection meant to provide a one-stop access point to the most important deliverables by the European biodiversity and ecosystem research infrastructure, which is currently undergoing a significant upgrade as a response to the needs of its target communities and stakeholders.

***

Original source:

Arvanitidis C, Barov B, Gonzalez Ferreiro M, Zuquim G, Kirrane D, Huertas Olivares C, Drago F, Pade N, Basset A, Deneudt K, Koureas D, Manola N, Mietchen D, Casino A, Penev L, Ioannidis Y (2025) From Knowledge to Solutions: Science, Technology and Innovation in Support of the UN SDGs. Research Ideas and Outcomes 11: e168765. https://doi.org/10.3897/rio.11.e168765

This publication is part of a collection:

LifeWatch ERIC Strategic Working Plan Outcomes Edited by Christos Arvanitidis, Cristina Huertas, Alberto Basset, Peter van Tienderen, Cristina Di Muri, Vasilis Gerovasileiou, Ana Mellado

***

About the contributing organisations:

LifeWatch ERIC

Europe’s biodiversity and ecosystem research infrastructure. LifeWatch ERIC provides access to biodiversity and ecosystem data, services and other research products: its virtual workbenches and digital twins for biodiversity science enable researchers worldwide to analyse biodiversity patterns, processes, and changes in ecosystems, and derive evidence-based knowledge for science and policy.

CSC – IT Center for Science

CSC hosts one of the world’s most powerful supercomputers (LUMI), pioneering biodiversity digital twins and climate models. CSC provides critical support for data-intensive projects that link computing, AI, and environmental science.

EGI Federation

A federation of hundreds of data centres providing global-scale computing, AI, and data services. EGI enables large-scale analysis of biodiversity and environmental data from sensors and satellites, supporting international collaboration.

VLIZ – Flanders Marine Institute

A hub for marine research, coordinating Europe’s Digital Twin of the Ocean and global biodiversity data systems, such as WoRMS (World Register of Marine Species). VLIZ drives blue innovation and ocean data integration.

The European Marine Biological Resource Centre (EMBRC-ERIC)

Europe’s infrastructure for marine biology, offering access to organisms, labs, and genomic observatories. EMBRC connects over 70 institutes across 10 countries, supporting research “from genes to ecosystems.”

The Distributed System of Scientific Collections (DiSSCo)

The largest initiative to digitise and unify Europe’s natural science collections into a single, FAIR-data-based infrastructure. DiSSCo makes museum collections globally accessible, boosting taxonomic, ecological, and environmental research.

OpenAIRE

A European e-Infrastructure dedicated to building a globally connected, interoperable, and sustainable open research ecosystem, with Open Science at its core. By offering a suite of services covering the entire research lifecycle, guidelines, and practices that support the adoption of Open Access and FAIR data principles across its network of National Open Access Desks in 34 countries, OpenAIRE supports local researchers, funders, and policymakers in aligning with European and global open science policies.

Pensoft

Founded in 1992 “by scientists, for scientists”, the academic open-access publishing company is well known worldwide for its novel cutting-edge publishing tools, workflows and methods for text and data publishing of journals, books and conference materials. Through its Research and Technical Development department, the company is involved in various research and technology projects. Pensoft coordinated the EU project BiCIKL (2021-2024), which established a new community of Research Infrastructures and users of FAIR and interlinked biodiversity data.

The Association for Computing Machinery (ACM)

The world’s largest computing society, established to foster ethical and responsible innovation. ACM brings global expertise in computing and AI to biodiversity research and policy.

Athena Research Centre

A leading ICT and AI research institute advancing digital infrastructures and open science platforms. Athena connects computing innovation with biodiversity, humanities, and societal challenges.

Pensoft to co-host a session on knowledge transfer & continuity at Living Data 2025

Pensoft is a co-organiser of a four-hour session, titled: “Long Live Biodiversity Data: Knowledge Transfer and Continuity across Research Projects”.

In October 2025, four major institutions in the biodiversity research landscape: TDWG, GBIF, OBIS and GEO BON, will come together as the organisers of the Living Data 2025 conference.

The event is set to be among one of the most crucial international gatherings of the year for experts and stakeholders in the field of biodiversity data. Set to take place in the Colombian capital of Bogotá between 21^st and 24^th, Living Data 2025 will centre around four core themes:

Open data
Data integration
Biodiversity data application
Community engagement and capacity-building

As an academic publisher with experience and commitment to all these thematic areas, Pensoft will participate in the event in the capacity of an exhibitor and an award sponsor, as well as a symposium host.

The conference delegates will have the chance to learn more about the publisher, its exclusively open-access scholarly portfolio and participation at various international scientific projects when they visit the company’s branded stand.

During the event, the scientific publisher and technology provider will also present the Pensoft Award for the Best Student Oral Presentation, which grants the winner a free publication in an open-access, peer-reviewed journal from our portfolio.

Crucially, Pensoft’s involvement in the Living Data 2025 programme also includes a dedicated four-hour session titled “Long Live Biodiversity Data: Knowledge Transfer and Continuity across Research Projects”.

The symposium will be jointly co-organised by Pensoft, LifeWatch ERIC and the Naturalis Biodiversity Centre. As the title suggests, the session will focus on the longevity of scientific outputs as they are generated, shared and re-used across disciplines, organisations and initiatives. In this context, tools, information hubs and workflows enabling exchanges that truly consolidate the global biodiversity data space over time will be showcased.

In a broader sense, the session will also seek to demonstrate how targeted communication can help transform science results into actionable knowledge by raising awareness among agenda-setters. This will speak to the potential of a multi-level approach to information sharing to bridge the gap between science and policy in relation to increasingly ambitious global environmental objectives.

Multiple projects affiliated with Pensoft will be represented in these deliberations, in order to share a diverse array of relevant insights:

The symposium will be divided into two sessions:

22 October (Wednesday): 10:45 AM – 12:45 PM (UTC/GMT-5)
23 October (Thursday): 10:45 AM to 12:45 PM (UTC/GMT-5)

You can find out more about Living Data, including the details on registering for an in-person or virtual attendance, on the conference’s website. Our session is listed on this page under ID number 6788879.

As an additional note, the organisers of the conference have launched a call for extended abstracts for all speakers at Living Data 2025 that will remain open until 1^st October 2025. The participants who opt to publish their conference abstracts in the Biodiversity Information Science and Standards (BISS) journal will enjoy permanent and far-reaching accessibility and discoverability for their conference contributions.

The TDWG network, who launched BISS as their official scholarly outlet in 2017 in collaboration with long-time partner Pensoft, have posted a list of the advantages for submitting an extended abstract, even though they have already had their abstracts accepted by the Living Data 2025 organisers. Amongst the reaslons are many perks typically associated with a conventional research article, such as DOI registration, indexation at dozens of scientific databases, embedded media, tables and supplementary materials, and usage metrics.

Mining nature’s knowledge: turning text into data

By using natural language processing, researchers created a reliable system that can automatically read and pull useful data from thousands of articles.

Guest blog post by Joseph Cornelius, Harald Detering, Oscar Lithgow-Serrano, Donat Agosti, Fabio Rinaldi, and Robert M Waterhouse

In a groundbreaking new study, scientists are using powerful computer tools to gather key information about arthropods—creatures like insects, spiders, and crustaceans—from the large and growing collection of scientific papers. The research focuses on finding details in published texts about how these animals live and interact with their environment. By using natural language processing (a type of artificial intelligence that helps computers understand human language), the team created a reliable system that can automatically read and pull useful data from thousands of articles. This innovative method not only helps us learn more about the variety of life on Earth, but also supports efforts to solve environmental challenges by making it easier to access important biological information.

Illustration depicting species literature feeding data on arthropod traits into a database, linking researchers and the community. — Mining the literature to identify species, their traits, and associated values.

The challenge

Scientific literature contains vast amounts of essential data about species—like what arthropods eat, where they live, and how big they are. However, this information is often trapped in hard-to-access files and old publications, making large-scale analysis almost impossible. So how can we convert these pages into usable data?

The goal

The team set out to develop an automatic text‑mining system using Natural Language Processing (NLP) and machine learning to scan thousands of biology papers and extract structured information about insects and other arthropods to build a database linking species names with traits like “leg length” or “forest habitat” or “predator”.

How it works in practice

Collect curated vocabularies of terms to be searched for in the texts:

~1 million species names from the Catalogue of Life
390 traits, categorised into feeding ecology, habitat, and morphology

Create “Gold‑standard” data needed to train language models:

Experts manually annotated 25 papers—labelling species, traits, values, and their links—to use as a training benchmark

Train NLP models so they “learn” which are the terms of interest:

Named‑Entity Recognition using BioBERT for identifying species, trait, and value words or phrases in the texts
Relation Extraction using LUKE to link the words/phrases e.g. “this species has this trait” and “this trait has this value”

Automated extraction of words/phrases and their links:

Processed 2,000 open‑access papers from PubMed Central
Identified ~656,000 entities (species, traits, values) and ~339,000 links between them

Publish results in an open searchable online resource:

Developed ArTraDB, an interactive web database where users can search, view, and visualise species‑trait pairs and full species‑trait‑value triples

Text-mining is a conceptually and computationally challenging task.

What is needed for the next steps

Annotation complexity: Even experts struggled to agree on boundaries and precise relationships, underscoring the need for clearer guidelines and more training examples to improve the performance of the models
Gaps in the vocabularies of terms: Many were unrecognised due to missing synonyms, outdated species names, and variations in phrasing. Expanding vocabularies will help improve the ability to find the species, traits, and values
Community curation: Planned features in ArTraDB will allow scientists and citizen curators to improve annotations, helping retrain and refine the models over time

How it impacts science

Speeds up research: Scientists can find species‑trait data quickly and accurately, boosting studies in ecology, evolution, and biodiversity
Scale and scope: This semi‑automated method can eventually be extended well beyond arthropods to other species
Supports global biodiversity efforts: Enables creation of large, quantitative trait datasets essential for monitoring ecosystem changes, climate impact, and conservation strategies

Illustration of a butterfly with icons and arrows outlining key biological data: barcode, genome, distribution, nutrition, habitat, and more. — A long-term vision to connect species with knowledge about their biology.

The outcomes

This innovative work demonstrates how combining text mining, expert curation, and interactive databases can unlock centuries of biological research. It lays a scalable foundation for building robust, open-access trait databases—empowering both scientists and the public to explore the living world in unprecedented ways.

Research article:

Cornelius J, Detering H, Lithgow-Serrano O, Agosti D, Rinaldi F, Waterhouse R (2025) From literature to biodiversity data: mining arthropod organismal traits with machine learning. Biodiversity Data Journal 13: e153070. https://doi.org/10.3897/BDJ.13.e153070

Bulgaria joins the Global Biodiversity Information Facility (GBIF)

Led by Pensoft and its CEO Prof. Lyubomir Penev, the partnership marks a major step for Bulgarian science and regional biodiversity leadership.

Bulgaria officially joins the Global Biodiversity Information Facility (GBIF). This major event for Bulgarian science was initiated by a memorandum signed by the Minister of Environment and Water: Manol Genov.

GBIF is an international network and data infrastructure funded by governments around the world that provides international open access to a modern and comprehensive database of all species of living organisms on the planet.

Joining GBIF is an important step for initiatives such as the Bulgarian Barcode of Life (BgBOL), as it will facilitate the integration of genetic data on species diversity into the global scientific community and support the creation of a more accurate and accessible bioinformatic database. This will increase the scientific visibility and relevance of Bulgarian efforts in molecular taxonomy and conservation.

Newly established Bulgarian Barcode of Life to support biodiversity conservation in the country

World map showing GBIF network participants: green for voting participants, blue for associate participants, gray for non-participants. — Prof. Lyubomir Penev

“First of all, I’d like to congratulate all fellow scientists working in the domain of biology and ecology in Bulgaria with this wonderful achievement,” says Prof. Dr. Lyubomir Penev, founder and CEO of the scientific publisher and technology provider Pensoft, as well as a key participant in the talks and preparations for Bulgaria’s joining GBIF. He is also Chair of BgBOL.

“Becoming a full member of GBIF has been a long-anticipated milestone we have discussed and worked on for several years. Coming not long after we initiated the Bulgarian Barcode of Life, the Bulgarian membership in GBIF gives us yet another uncontested evidence that the nation is on the right path to preserving our uniquely rich fauna and flora,” he adds.

Pensoft is looking forward to sharing our know-how with Bulgarian institutions and scientists in order to streamline the visibility and overall efficiency of biodiversity data collected from Bulgaria.
Prof. Lyubomir Penev

“As close partners of GBIF for over 15 years now, Pensoft is looking forward to sharing our know-how with Bulgarian institutions and scientists, so that they can fully utilise the GBIF infrastructure and tools, in order to streamline the visibility and overall efficiency of biodiversity data collected from Bulgaria.”

GBIF is managed by a Secretariat based in Copenhagen and brings together countries and organisations that collaborate through national and institutional coordinators (also called participant nodes). The mechanism provides common standards, good practices and open access tools for institutions around the world to share information on the location and recording of species and specimens. According to GBIF, a total of 107 countries and organisations currently participate in the network, a significant number of which are European.

The GBIF network, as screenshot from https://www.gbif.org/the-gbif-network on 10/06/2025.

By joining GBIF, biodiversity data generated in Bulgaria can be streamlined through the network’s infrastructure so that the country does not need to build and maintain its own separate infrastructure, which also saves significant financial resources.

As a full voting member, Bulgaria will ensure that biodiversity data in the country will be shared and accessible through the platform, and will contribute to global knowledge on biodiversity, respectively to the solutions that will promote its conservation and sustainable use.

Map of Bulgaria showing biodiversity data with orange heatmap indicating occurrences. — Bulgaria’s page on GBIF, as screenshot from https://www.gbif.org/country/BG/summary on 10/06/2025.

Improvements in data management by Bulgaria will also contribute to better reporting and fulfilment of obligations to the Convention on Biological Diversity (CBD) as well as to the Intergovernmental Platform on Biodiversity and Ecosystem Services (IPBES). As a member of GBIF, Bulgaria will be able to apply for funding for flagship activities in Bulgarian institutions and neighbouring Balkan countries. This will enable the country to expand its leadership role in the Balkans in biodiversity research and data accumulation.

GBIF and Pensoft signed a Memorandum of Cooperation

The partnership between GBIF and Pensoft dates back to 2009 when the global network and the publisher signed their first Memorandum of Understanding intended to solidify their cooperation as leaders in the technological advancement relevant to biodiversity knowledge. Over the next few years, Pensoft integrated its whole biodiversity journal portfolio with the GBIF infrastructure to enable multiple automated workflows, including export of all species occurrence data published in scientific articles straight to the GBIF platform. Most recently, over 20 biodiversity journals powered by Pensoft’s scholarly publishing platform ARPHA launched their own hosted portals on GBIF to make it easier to access and use biodiversity data associated with published research, aligning with principles of Findable, Accessible, Interoperable, and Reusable (FAIR) data.

Journals published on ARPHA now archived in the Biodiversity Heritage Library

To date, the content available on BHL includes 16,000 legacy articles and also extends to future articles.

Content from more than 30 biodiversity journals published on the ARPHA Platform will now be archived in the Biodiversity Heritage Library (BHL), the world’s largest open-access digital library for biodiversity literature and archives.

A global consortium of natural history, botanical, research, and national libraries, BHL digitises and freely shares essential biodiversity materials. A critical resource for researchers, it provides vital access to material that might otherwise be difficult to obtain.

Under the agreement, over 16,000 articles published on Pensoft’s self-developed ARPHA Platform are now available on BHL. Both legacy content and new articles are made available on the platform, complete with full-text PDFs and all relevant metadata.

Thanks to this integration, content in our journals will become even more accessible and readily discoverable, helping researchers find the biodiversity information they need.
Prof. Lyubomir Penev

More content published on ARPHA will gradually be added to the BHL archive.

The publications will be included in the Library’s full-text search, allowing researchers to easily locate relevant biodiversity literature. Crucially, the scientific names within the articles will be indexed using the Global Names Architecture, enabling seamless discovery of information about specific taxa across the BHL collection.

This automated workflow is facilitated by the ARPHA platform and uses the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to enable exposure and harvesting of repository metadata.

“Pensoft is pleased to collaborate with BHL in our joint mission to support global biodiversity research through free access to knowledge. Thanks to this integration, content in our journals will become even more accessible and readily discoverable, helping researchers find the biodiversity information they need,” said Prof. Lyubomir Penev, CEO and founder of Pensoft and ARPHA.

The news comes soon after BHL announced it is about to face a major shift in its operation. From 2026, the Smithsonian Institution – one of BHL’s 10 founding members – will cease to host the administrative and technical components of BHL. As the consortium explores a range of options, the BHL team is confident that “the transition opens the door to a reimagined and more sustainable future for BHL.”

Biodiversity Knowledge Hub makes an appearance at the European Geosciences Union General Assembly 2025

The Biodiversity Knowledge Hub fosters interoperability between diverse resources to make it easier to use and combine information.

*Gabriel Peoluze (LifeWatch ERIC) presents the Biodiversity Knowledge Hub poster at EGU 2025*
*(Vienna, Austria).*

On Monday, 28 April, the first day of the European Geosciences Union General Assembly 2025 (EGU 2025), participants had the chance to discover one of the most promising initiatives in biodiversity informatics: the Biodiversity Knowledge Hub (BKH). BKH was presented as part of a dedicated poster session, titled “Biodiversity Knowledge Hub: Addressing the impacts of environmental change by linking Research Infrastructures, Global Aggregators and Community Networks“.

Understanding and addressing the impacts of environmental change on biodiversity and ecosystems demands access to reliable FAIR data (as in Findable, Accessible, Interoperable, Reusable). However, the current landscape is often fragmented, making it difficult to combine and use these resources effectively.

Enter the Horizon-funded project Biodiversity Community Integrated Knowledge Library (BiCIKL): a pioneering initiative that demonstrates the transformative power of interdisciplinary collaboration. Coordinated by Pensoft, BiCIKL ran between 2021 and 2024.

New BiCIKL project to build a freeway between pieces of biodiversity knowledge

The Vision of BiCIKL

Within BiCIKL, 14 European institutions from ten countries teamed up with the aim to integrate biodiversity data across research infrastructures, scientific repositories, and expert communities.

Through this integration, BiCIKL bridged the gap between isolated knowledge systems and delivered actionable insights to guide conservation and resilience efforts. The project embodies the principles of open science by demonstrating how interdisciplinary collaboration can turn fragmented data into cohesive, usable knowledge for researchers, policymakers, and practitioners.

How to ensure biodiversity data are FAIR, linked, open and future-proof?

The Biodiversity Knowledge Hub

At the heart of BiCIKL’s success is the Biodiversity Knowledge Hub (BKH): an innovative platform that provides seamless access to biodiversity data, tools, and workflows. The BKH fosters interoperability between diverse resources, thus making it easier to combine information from different sources. Whether for advanced research analytics or policymaking in support of sustainable development, the BKH empowers users with tools tailored to their needs.

A few of the standout features of the BKH include:

Modular design to allow continuous expansion and adaptability to new challenges in biodiversity and climate resilience
Interoperable systems that connect a variety of databases, repositories, and services to deliver integrated knowledge.
Community building by welcoming a broad network of stakeholders to ensure the platform’s long-term sustainability and growth.

Watch the Biodiversity Knowledge Hub video on YouTube.

Setting a New Benchmark in Biodiversity Informatics

Through its collaborative approach, BiCIKL set a new standard for how biodiversity and climate resilience initiatives can be harmonised globally. By showcasing best practices in data integration, capacity building, and stakeholder engagement, BiCIKL became much more than a project: it turned into a blueprint for future biodiversity knowledge infrastructures.

The Biodiversity Knowledge Hub serves to demonstrate how harmonised standards and active collaboration are key to unlocking the full potential of biodiversity data. In doing so, its mission is to create scalable, long-term solutions that are crucial for addressing today’s pressing environmental challenges.

The poster presentation at EGU25 outlined the methodologies and technologies driving the BKH, emphasizing its role as a pioneering model for integrated biodiversity knowledge and action. As environmental pressures continue to mount, the work of BiCIKL and the Biodiversity Knowledge Hub offers a hopeful path forward—one where knowledge flows freely, collaborations flourish, and data-driven solutions guide our way to a more resilient future.

Visit the Biodiversity Community Integrated Knowledge Library (BiCIKL) project’s website at: https://bicikl-project.eu/.

Don’t forget to also explore the Biodiversity Knowledge Hub (BKH) for yourself at: https://biodiversityknowledgehub.eu/ and watch the BKH’s introduction video.

Revisit highlights from the BiCIKL project on X/Twitter from the project’s hashtag: #BiCIKL_H2020 and handle: @BiCIKL_H2020.

BiCIKL project sums up outcomes and future prospects at a Final GA in Cambridge

Pensoft joins the Biodiversity Meets Data Horizon project to support biodiversity monitoring and conservation

As part of the new consortium, Pensoft is to use innovative communication tools in support of evidence-based biodiversity conservation across Europe.

The European Union (EU) has been working to protect nature for decades, with the Natura 2000 network now safeguarding over 18% of EU land and 9% of its marine territory. Yet, biodiversity is still in trouble, with only 50% of bird species and 15% of habitats in good conservation status.

To turn the tide, the EU’s Biodiversity Strategy for 2030 will expand the existing Natura 2000 areas, implement the EU’s first-ever Nature Restoration Law, and introduce concrete measures to achieve global biodiversity targets. Success will depend on enhancing biodiversity monitoring, making better use of data and gaining a clearer picture of how nature is changing.

Addressing this urgent challenge, the EU Horizon project BMD (abbreviated for Biodiversity Meets Data) will offer a centralised platform (Single Access Point or SAP) for improved biodiversity monitoring across Europe.

Pensoft’s role

Pensoft will play a role in Biodiversity Meets Data’s impact by planning and implementing the communication, dissemination and exploitation of project results, as well as helping with the training and capacity building for BMD’s end-users, which will be led by LifeWatch ERIC. Pensoft will adopt a multi-format approach to knowledge transfer with tailored outputs for the scientific community, decision-makers, industry representatives and the general public.

Furthermore, the BMD SAP will also incorporate elements of the Biodiversity Knowledge Hub (BKH), developed under the BiCIKL project, coordinated by Pensoft.

“It’s incredibly rewarding to see the continuity in our projects, with the legacy of the BiCIKL project continuing with Biodiversity Meets Data. This seamless progression not only builds on our past successes but also ensures that our work continues to deliver long-lasting value to the biodiversity community.”
said Prof. Dr. Lyubomir Penev, CEO and Founder of Pensoft, and project coordinator of BiCIKL (abbreviated from Biodiversity Community Integrated Knowledge Library).

The BMD project consortium at the project’s kick-off meeting in early March 2025 (Leiden, the Netherlands).

International consortium

Coordinated by Naturalis Biodiversity Center, the project brings together 14 partner organisations from 11 countries to develop innovative solutions for biodiversity management.

Naturalis Biodiversity Center – the Netherlands
Royal Botanic Garden Edinburgh – the United Kingdom
Meise Botanic Garden – Belgium
Helmholtz Centre for Environmental Research – Germany
e-Science European Infrastructure for Biodiversity and Ecosystem Research – Spain
Pensoft Publishers – Bulgaria
The European Land Conservation Network – the Netherlands
University of Tartu – Estonia
Stichting Catalogue of Life – the Netherlands
The International Hellenic University – Greece
The Senckenberg Nature Research Society – Germany
The Environment Agency Austria – Austria
The National Research Council – Italy
SIB Swiss Institute of Bioinformatics – Switzerland

For more information:

Visit the BMD project website at https://bmd-project.eu/, and make sure to follow the project’s progress via our social media channels on Bluesky and Linkedin.

More than 20 journals published by Pensoft with their own hosted data portals on GBIF to streamline and FAIR-ify biodiversity research

The portals currently host data on over 1,000 datasets and almost 325,000 occurrence records across the 25 journals.

In collaboration with the Global Biodiversity Information Facility (GBIF), Pensoft has established hosted data portals for 25 open-access peer-reviewed journals published on the ARPHA Platform.

A screenshot featuring a close-up of a turtle on a forest floor, overlayed with a web portal design for biodiversity data browsing. — A screenshot of the Check List data portal.

The initiative aims to make it easier to access and use biodiversity data associated with published research, aligning with principles of Findable, Accessible, Interoperable, and Reusable (FAIR) data.

The data portals offer seamless integration of published articles and associated data elements with GBIF-mediated records. Now, researchers, educators, and conservation practitioners can discover and use the extensive species occurrence and other data associated with the papers published in each journal.

A video displaying an interactive map with occurrence data on the BDJ portal.

The collaboration between Pensoft and GBIF was recently piloted with the Biodiversity Data Journal (BDJ). Today, the BDJ hosted portal provides seamless access and exploration for nearly 300,000 occurrences of biological organisms from all over the world that have been extracted from the journal’s all-time publications. In addition, the portal provides direct access to more than 800 datasets published alongside papers in BDJ, as well as to almost 1,000 citations of the journal articles associated with those publications.

The Biodiversity Data Journal launches its own data portal on GBIF

“The release of the BDJ portal and subsequent ones planned for other Pensoft journals should inspire other publishers to follow suit in advancing a more interconnected, open and accessible ecosystem for biodiversity research,” said Dr. Vince Smith, Editor-in-Chief of BDJ and head of digital, data and informatics at the Natural History Museum, London.

Joining the @ejtaxonomy, the @BioDataJournal is the latest #ScientificJournal to launch a GBIF hosted portal! 🐟

This @Pensoft-published journal is the first of many under the masthead expected to participate in the GBIF programme. ⚡

Read more: 🔗https://t.co/IA3IWydRLy pic.twitter.com/pbulurX9Kn
— GBIF @biodiversity.social/@gbif (@GBIF) March 10, 2025

“The programme will provide a scalable solution for more than thirty of the journals we publish thanks to our partnership with Plazi, and will foster greater connectivity between scientific research and the evidence that supports it,” said Prof. Lyubomir Penev, founder and chief executive officer of Pensoft.

On the new portals, users can search data, refining their queries based on various criteria such as taxonomic classification, and conservation status. They also have access to statistical information about the hosted data.

Together, the hosted portals provide data on almost 325,000 occurrence records, as well as over 1,000 datasets published across the journals.

The Biodiversity Data Journal launches its own data portal on GBIF

With this simple website designed to lower technical demands, data managers and other stakeholders can easily focus on data exploration and reuse.

The Biodiversity Data Journal (BDJ) became the second open-access peer-reviewed scholarly title to make use of the hosted portals service provided by the Global Biodiversity Information Facility (GBIF): an international network and data infrastructure aimed at providing anyone, anywhere, open access to data about all types of life on Earth.

The Biodiversity Data Journal portal, hosted on the GBIF platform, is to support biodiversity data use and engagement at national, institutional, regional and thematic scales by facilitating access and reuse of data by users with various expertise in data use and management.

Having piloted the GBIF hosted portal solution with arguably the most revolutionary biodiversity journal in its exclusively open-access scholarly portfolio, Pensoft is to soon replicate the effort with at least 20 other journals in the field. This would mean that the publisher will more than double the number of the currently existing GBIF-hosted portals.

As of the time of writing, the BDJ portal provides seamless access and exploration for nearly 300,000 occurrences of biological organisms from all over the world that have been extracted from the journal’s all-time publications. In addition, the portal provides direct access to more than 800 datasets published alongside papers in BDJ, as well as to almost 1,000 citations of the journal articles associated with those publications.

The release of the BDJ portal should inspire other publishers to follow suit in advancing a more interconnected, open and accessible ecosystem for biodiversity research
Vince Smith

Using the search categories featured in the portal, users can narrow their query by geography, location, taxon, IUCN Global Red List Category, geological context and many others. The dashboard also lets users access multiple statistics about the data, and even explore potentially related records with the help of the clustering feature (e.g. a specimen sequenced by another institution or type material deposited at different institutions). Additionally, the BDJ portal provides basic information about the journal itself and links to the news section from its website.

A video displaying an interactive map with occurrence data on the BDJ portal.

Launched in 2013 with the aim to bring together openly available data and narrative into a peer-reviewed scholarly paper, the Biodiversity Data Journal has remained at the forefront of scholarly publishing in the field of biodiversity research. Over the years, it has been amongst the first to adopt many novelties developed by Pensoft, including the entirely XML-based ARPHA Writing Tool (AWT) that has underpinned the journal’s submission and review process for several years now. Besides the convenience of an entirely online authoring environment, AWT provides multiple integrations with key databases, such as GBIF and BOLD, to allow direct export and import at the authoring stage, thereby further facilitating the publication and dissemination of biodiversity data. More recently, BDJ also piloted the “Nanopublications for Biodiversity” workflow and format as a novel solution to future-proof biodiversity knowledge by sharing “pixels” of machine-actionable scientific statements.

A decade of empowering biodiversity science: celebrating 10 years of Biodiversity Data Journal

“I am thrilled to see the Biodiversity Data Journal’s (BDJ) hosted portal active, ten years since it became the first journal to submit taxon treatments and Darwin Core occurrence records automatically to GBIF! Since its launch in 2013, BDJ has been unrivalled amongst taxonomy and biodiversity journals in its unique workflows that provide authors with import and export functions for structured biodiversity data to/from GBIF, BOLD, iDigBio and more. I am also glad to announce that more than 30 Pensoft biodiversity journals will soon be present as separate hosted portals on GBIF thanks to our long-time collaboration with Plazi, ensuring proper publication, dissemination and re-use of FAIR biodiversity data,” said Prof. Dr. Lyubomir Penev, founder and CEO of Pensoft, and founding editor of BDJ.

“The release of the BDJ portal and subsequent ones planned for other Pensoft journals should inspire other publishers to follow suit in advancing a more interconnected, open and accessible ecosystem for biodiversity research,” said Vince Smith, editor-in-chief of BDJ and head of digital, data and informatics at the Natural History Museum, London.